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Abstract 

Background: As yet, few genomic resources have been developed in crustaceans. This lack is particularly evident 
in Copepoda, given the extraordinary numerical abundance, and taxonomic and ecological diversity of this group. 
Tigriopus californicus is ideally suited to serve as a genetic model copepod and has been the subject of extensive 
work in environmental stress and reproductive isolation. Accordingly, we set out to develop a broadly-useful panel 
of genetic markers and to construct a linkage map dense enough for quantitative trait locus detection in an 
interval mapping framework for T. californicus-a first for copepods. 

Results: One hundred and ninety Single Nucleotide Polymorphisms (SNPs) were used to genotype our mapping 
population of 250 F 2 larvae. We were able to construct a linkage map with an average intermarker distance of 1.8 
cM, and a maximum intermarker distance of 10.3 cM. All markers were assembled into linkage groups, and the 12 
linkage groups corresponded to the 12 known chromosomes of T. californicus. We estimate a total genome size of 
401.0 cM, and a total coverage of 73.7%. Seventy five percent of the mapped markers were detected in 9 
additional populations of T. californicus. Of available model arthropod genomes, we were able to show more 
colocalized pairs of homologues between T. californicus and the honeybee Apis mellifera, than expected by chance, 
suggesting preserved macrosynteny between Hymenoptera and Copepoda. 

Conclusions: Our study provides an abundance of linked markers spanning all chromosomes. Many of these 
markers are also found in multiple populations of T. californicus, and in two other species in the genus. The 
genomic resource we have developed will enable mapping throughout the geographical range of this species and 
in closely related species. This linkage map will facilitate genome sequencing, mapping and assembly in an 
ecologically and taxonomically interesting group for which genomic resources are currently under development. 



Background 

Copepods are among the most abundant metazoans on 
the planet and include more than 11,500 described spe- 
cies [1]. They are ecologically important, spanning a 
broad range of marine, freshwater and moist terrestrial 
habitats and serving as major players in global energy 
transfer, including the largest portion of carbon turnover 
in the oceanic food web [2]. They are also of economic 
interest, serving as both prey and parasites of valuable 
fishery species. Despite their ecological and economic 
importance, there are few genetic resources developed 
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for copepods, and existing models are likely to be of 
limited use in this group. 

The overwhelming majority of arthropod model sys- 
tems are insects. Additionally, Daphnia pulex (a bran- 
chiopod crustacean) has recently been sequenced [3]. 
However, neither insects nor Daphnia are phylogeneti- 
cally close to copepods. While the precise topology of 
the arthropod phylogeny is currently debated, the split 
between Insecta, Copepoda and Branchiopoda is deep in 
the Cambrian. Shrimp (decapod crustaceans) are some- 
what more closely related to copepods and have been 
the subject of genetic work [e.g. 4, 5] due to their eco- 
nomic importance in the seafood market. However, like 
other decapods, shrimp have high rates of genomic 
duplication, complicating genomic analyses and render- 
ing them less useful as models [6-8]. 
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Tigriopus californicus is an increasingly attractive 
genetic model copepod and model crustacean. It has a 
history as an experimental system for environmental 
stress, ecotoxicology and speciation [9-12], is easily cul- 
tured in the lab, and has an experimentally tractable life 
cycle. It also has one of the smaller known crustacean 
genomes-a full order of magnitude smaller than the 
tiger shrimp Penaeus monodon [5]. 

T. californicus has an extensive range along the west 
coast of North America, from Alaska to Baja California. 
It is a supralittoral marine harpacticoid copepod, inha- 
biting splash pools of the rocky shore. It has extremely 
limited gene flow, with strong genetic differentiation 
between even geographically close populations [13,14]. 
Populations from Santa Cruz and San Diego, California 
are the focus of concerted efforts for the development 
of genetic resources by several labs. The transcriptomes 
of the two populations have recently been published 
[10]. A linkage map comprising 8 microsatellite markers 
and 45 nuclear SNPs has also recently been constructed 
[15] with a corrected map length of 484.8 cM, where 
the 11 linkage groups and one unlinked marker appar- 
ently corresponded to the 12 chromosomes of T. califor- 
nicus [16]. 

A linkage map is a reference point for understanding 
the genetic basis of phenotypic variation. It is a valuable 
tool for genome-wide scans of genetic variation, for dis- 
section of QTL (quantitative trait loci) and for the 
assembly of a physical map. By focusing on developing 
gene-based markers, our map will facilitate comparative 
studies using orthologous genes. 

We here develop a linkage map between the Santa 
Cruz and San Diego populations, comprising 190 SNP 
markers. We validate these markers in 9 additional, geo- 
graphically distinct populations of T. californicus. We 
also screen the markers in two commonly studied con- 
geners with similar ecology: T. japonicus, which is dis- 
tributed around the Asian Pacific rim, and T. 
brevicornis, which is distributed from Portugal to Nova 
Scotia around the Northern Atlantic. Many of the mar- 
kers we develop are also reliably detectable in these two 
congeners, and will be broadly useful as a community 
resource for genetic mapping in these species. The 
development of this linkage map is an essential prelude 
to the more detailed genomic characterization of a 
model copepod. 

Methods 

SNP development 

The majority of SNPs were identified from 454 cDNA 
sequence data for the Santa Cruz (SC, 36°57'N, 122° 
03'W) and San Diego (SD, 32°45'N, 117°15'W) T. califor- 
nicus populations [10]. Sequences were aligned using the 
gsAssembler (Roche) and visualized in Consed [17]. 



Diagnostic SC/SD fixed SNPs were chosen using the fol- 
lowing criteria: contig > 200bp, minimum of 2 reads for 
each population, significant sequence alignment to 
metazoan sequence (BLAST2GO [18]) with an E-value 
< 10' and sufficient conserved flanking regions for 
priming sites. Another subset of potential SNPs were 
identified from known gene sequences obtained from 
Genbank or from unpublished data (R. Burton). For 
each of the markers with a positive genetic identifica- 
tion, the name of the marker is consistent with the 
putative-orthologue name, taken as the most-significant 
alignment using tBlastx at [19]. The one exception is 
TFAM, where the most significant alignment is to a pre- 
dicted gene product in Tribolium castaneum of the gen- 
eral class of transcription factors which includes TFAM. 
Subsequent best alignments are to TFAM orthologues 
in a number of species. 

Primers and extension sequences for iPlex high 
throughput genotyping were developed from this dataset 
by Jeffrey Conroy at the Genomic Shared Resources, Ros- 
well Park Cancer Institute, Buffalo, NY, using the iPlex 
Gold software (Sequenom, San Diego CA). Primer and 
extension sequences are shown in Additional File 1. 201 
SNP assays were designed for multiplexing in a total of 7 
wells, with a maximum of 30 assays in each well. 
Sequences ranged from 15-29bp. Within each well, the 
minimum molecular weight of a sequence was 4567 dal- 
tons, and the maximum weight was 8949.9 daltons 
(Additional File 1). The minimum difference between 
any two products was 16 daltons. Sequences were scored 
for molecular weight in a Sequenom mass spectrometer, 
and SNP identity determined by molecular weight. 

SNPs were validated for 5 adult males and 5 adult 
females from each of the SC and SD lines, and found to 
be fixed for alternate alleles in each population, with 
heterozygotes confirmed in 20 ¥ 1 hybrids. SNPs were 
also tested in 9 additional populations of T. californicus 
(Table 1), and a single population each of T. brevicornis 
and T. japonicus (average n = 5). A panel of 46 pre- 
viously utilized SNPs [15] was also included, including 2 
mitochondrial markers to test for contamination. 

DNA extraction 

Nauplii (larvae) for genotyping were hatched from late- 
stage egg sacs isolated in single drops of seawater in a 
petri plate. The petri plate was flooded with lysis buffer 
(10 mM Tris pH 8.3, 50 mM KC1, 0.5% Tween 20). 
Individual nauplii were pipetted in 2-5 [il of buffer to 
200 \xl PCR tubes containing 10 ul of buffer with 200 
(ig/ml Proteinase K (final concentration). Tubes were 
incubated at 65°C for 1 hr followed by 100°C for 15 min 
and then frozen at -80°C. DNA extraction of adult indi- 
viduals followed a similar procedure except that indivi- 
duals were briefly rinsed in diH 2 0 and blotted dry on 
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Table 1 Patterns of marker segregation among populations of T. californicus and two sister species. 
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Number of fixed differences in SNPs between populations (above diagonal); number of polymorphic SNPs within population (diagonal); number of chromosomes 
between populations potentially distinguishable by the full panel of 190 SNP markers, including fixed differences and polymorphic alleles (below diagonal). 



SD: San Diego, CA, 32°45'N, 117°15'W; SC: Santa Cruz, CA, 36°57'N, 122°03'W; PA: Playa Altamira, Baja California, 28°33'N, 114°05'W; LMC: Los Morros Colorados, 
Baja California, 29°43'N, 115°18'W; RP: Royal Palms, Palos Verdes CA, 33°42'N, 118°19'W; Cat: Catalina Island, CA, 33°27'N, 118 29'W; LC: Leo Carillo Beach, CA, 34' 
03'N, 118°56W; SCI: Santa Cruz Island, CA, 34°03'N, 119°34'W; CAR: Carmel, CA, 36°55'N, 12T93W; AG: Aguilar, BC, 48°51'N, 125°08'W; WIZ: Wizard, BC, 48°85'N, 
125°16W; T.bv: Tigriopus brevicornus; T.jp: Tigriopus japonicus. 



filter paper (egg sacs were gently removed from females) 
before being frozen at -80°C. Adult samples were subse- 
quently extracted in 50 ul lysis buffer with 200 [il/ml 
Proteinase K (final concentration). 20 ul of each sample 
was dried at 60°C for 4 hours before being sent to the 
Roswell Park Cancer Institute for genotyping, and the 
remaining 30 ul was retained at -80°C for future work. 

F 2 mapping population 

Upon maturation, males of T. californicus perform mate 
guarding-holding an immature female until she is 
mature, then mating with her and releasing her. Obser- 
ving clasped dyads facilitates sexing of immature indivi- 
duals, and allows the ready collection of virgins. Females 
are known to mate only a single time in their life, and 
use stored sperm to fertilize multiple clutches of eggs 
[20]. Inbred-isofemale-lines are therefore established by 
isolating a single fertilized female and allowing offspring, 
including those from overlapping generations, to mate 
freely. 

Isofemale lines were established from SC and SD 
populations and were maintained in the laboratory for 
2.5 years. With a minimum generation time of 23 days, 
at 20°C, this corresponds to approximately 39 genera- 
tions of culture. All laboratory stocks were reared in a 
standard medium of thrice-filtered (40 um nitex filters) 
seawater with added growth medium (0.1 g of spirulina 
and 0.1 g of ground Tetramin per litre of seawater). 
Crosses between SD females and SC males were con- 
ducted in 250 ml beakers containing 200 ml of growth 
medium. Six replicate crosses consisted each of 50 



males and 50 females. Males were discarded after 1 
week. Females were removed after 2 weeks, when cope- 
podids (juveniles) became visible. When clasping pairs 
of ¥i offspring were observed, the males and females 
were separated and then united with a partner descend- 
ing from a different beaker to avoid potential full sibling 
mating (see Additional File 2). Single pairs comprising 
an Fj female and an ¥ 1 male were placed in separate 
petri plates containing 12 ml of culture medium. Males 
were removed when an egg sac was observed, and 
females moved to fresh plates as each successive clutch 
hatched. 250 first instar nauplii were collected for geno- 
typing. In previous experiments, little segregation distor- 
tion has been observed in early stage nauplii [15]. 

Additional samples for genotyping 

In addition to the F 2 hybrids we also geno typed 19 non- 
recombinant backcross hybrid adults [(SD female x SC 
male) ¥ 1 female x SD male; see Additional File 2]. 
Because T. californicus females do not undergo recombi- 
nation [16,21] this non-recombinant cross allows confir- 
mation that linkage groups identified from the F 2 
mapping population are on different chromosomes. 
Lastly, between 5 and 10 adult individuals from 9 other 
populations of T. californicus, as well as T. brevicornis 
and T. japonicus were collected for genotyping. 

Mapping 

The use of previously mapped SNPs [15] and cyto- 
chrome-c [22] enabled us to maintain chromosomal 
identity from previous work. Chromosomes 1, 2, 5, 7, 8, 
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9, 10, 11 and 12 were anchored to the linkage groups of 
Harrison and Edmands [23] and Pritchard et al. [15] 
(where chromosome 12 corresponds with linkage group 
D). Our current chromosomes 3, 4 and 6 correspond to 
the linkage groups of A, B, and C respectively of Pritch- 
ard et al. [15] and we were not able to anchor them to 
Harrison and Edmands [23]. 

The linkage map was constructed using the JoinMap 
3.0 software package [24]. The Kosambi mapping func- 
tion was used, with an LOD threshold of 1.0, a recombi- 
nation threshold of 0.4, and default values for all other 
settings. Correct assignment of markers to chromo- 
somes was confirmed using the non-recombinant back- 
cross genotypes. Several markers exhibited substantial 
apparent segregation distortion, departing significantly 
from Hardy Weinberg expectation (% 2 >14, p < 0.001, df 
= 1). We did not exclude these a priori from the analy- 
sis. Following construction of the map, we examined the 
proposed haplotypes and found that, for the 5 most dis- 
torted markers we would have to posit recombination in 
females at levels above 25% between the distorted mar- 
kers and their nearest neighbors to explain the marker 
states. As recombination in females is known to not 
occur at levels >1% [16,21], we excluded these 5 markers 
from the analysis. We were unable to determine the 
source of this apparent genotyping error, but one possi- 
bility is cryptic, previously unidentified polymorphism. 

Predicted effect of SNPS on protein sequence 

Nucleotides that are under lower selective constraint 
might be expected to have higher substitution rates, and 
thus may be expected to have more alleles across popu- 
lations. For all SNPs, we used the probable coding 
sequences of Barreto et al. [10] to determine whether 
the SNP was in a translated region of the transcript, and 
if so, whether or not the SNPs were synonymous. SNP 
positions and translations were calculated by matching 
the Sequenom extension sequences against the tran- 
scripts of Baretto et al [10] using the Bioperl (vl.6.1) 
module of Perl (v 5.12.2). 

Synteny with model species 

We attempted to find signs of macrosynteny with sev- 
eral assembled arthropod genomes-the insects Apis 
mellifera [25], assembly 4.5, with a In = 16; D. melano- 
gaster [26] assembly 4.3, with In = 5 (commonly the 
major chromosomal arms are treated as independent 
linkage groups); and Tribolium castaneum [27] assembly 
3.0 with a In = 12; and the recently sequenced Bran- 
chiopod crustacean, Daphnia pulex [3] beta 3 release 
with a In = 12. Of these, the insect genomes were the 
most fully assembled, and we were able to disregard any 
unanchored scaffolds. In D. pulex, there were far more 
unanchored scaffolds-3804-and we did not exclude any 



(although the majority of the genome is assembled in 
the first 50 scaffolds and the majority of our Blast align- 
ments were expected to localize on these). 

To identify homologs, we performed sequence align- 
ment searches using tBlastx of the flanking sequence for 
each of our mapped markers (cDNA contigs for new 
markers and ESTs for markers from Pritchard et al. [15]) 
against each reference draft genome assembly, with a 
threshold of E-value < 10" and the default settings. Sev- 
eral sequences had multiple alignments exceeding this 
threshold, and in these cases we considered only the 
most significant sequence alignment. The A. mellifera 
amino acid sequences were obtained from[28]. The T. 
castaneum amino acid sequences were obtained from 
[29], and the chromosomal location of the most signifi- 
cant alignments for both the A. mellifera and T. casta- 
neum genomes were obtained by querying the Batch 
Entrez database at [19] with the gi numbers. The D. mel- 
anogaster v. 4.3 sequences were obtained from Flybase 
[30]. The D. pulex amino acid sequences were obtained 
from [31] and the scaffold identity of these sequences 
subsequently obtained by tBlastn to the genomic scaffold 
obtained at the same site. Where a pair of putative homo- 
logs colocalized to a single linkage group in both species, 
we scored it as a single potential syntenic unit. Where 
three markers colocalized in this way, we scored 2 align- 
ments (in no case did more than 3 homologs colocalize 
in both species). We summed all potential syntenic units 
for each species (Table 2). Significance was determined 
by randomly permuting the homolog locations relative to 
the T. californicus chromosomal map positions 1000 
times in R [32], and counting the number of estimated 
syntenic units each time. 

Results 

SNP development 

Of the total of 245 nuclear SNPs we assayed, 190 SNPs 
were found to be reliable in our genotyping pools, 
including 38 of the previously developed markers [15]. 
Only these 190 SNP markers are reported here (Addi- 
tional File 1). In nauplii, none of these markers were 
found to be significantly distorted after Bonferroni cor- 
rection. Primers and extension sequences were found to 
be widely conserved within populations of T. californi- 
cus, as well as sister species T. brevicornis and T. japoni- 
cus. Many of the SNPs identified in the SC-SD 
comparison were polymorphic within other populations 
(Table 1, diagonal, bold; Additional File 1). The number 
of fixed differences between populations, as well as seg- 
regating polymorphisms within populations, together 
represent the potential pool of markers for mapping in 
T. californicus. The number of potential markers for use 
in mapping differences between population pairs 
appears in the above-diagonal of Table 1. 
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Table 2 Number of pairs of colocalizing loci in both Tigriopus califomicus and assembled reference genomes. 
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The number of linkage groups of the draft reference genomes of each species are shown, with the number of identified homologues between T. califomicus and 
each of the reference genomes, and the number of homologue pairs which colocalize to a linkage group in both T. califomicus and the reference. The maximum 
(threshold) number of colocalized pairs we would expect to see by chance, and the significance of the observed number of colocalized pairs were determined by 
permutation. 



Mapping 

The final assembly consisted of 12 linkage groups, cor- 
responding to the known 12 chromosomes of T. cali- 
fomicus (Figure 1). These 12 linkage groups were 
confirmed by results from the non-recombinant back- 
cross. The average intermarker distance was 1.81 cM, 
and the maximum distance between adjacent markers 
was 10.3 cM. The sum of the distances between the 
markers gives an estimated linkage map length of 
323.5 cM long. Applying the suggested correction 
method [number 4 in 33] to estimate total map length, 
where L[ is map length of a given chromosome i, and 
mi is the number of markers for that linkage group, 



hot = T,ii L i+2) x (m+ l)/(m- 1) = 401.0 cM We esti- 
mated total coverage following [34]. Given total mar- 
ker number n = 190, and average intermarker distance 
d = 1.81 cM, the proportion coverage 
c = l — e -2dn/L lol _ 0.737. The recombination rate, given 
the estimated haploid genome size of 244.5Mb [35,36], 
corresponds approximately 1.6 cM/Mbp. The sizes of 
the linkage groups ranged from 19 to 32 cM. 

Predicted effect of SNPs on protein sequence 

Of the 190 SNPs used to construct the linkage map, 125 
(66%) are predicted to be translated, and of these 125, 
101 (79.5%) are synonymous substitutions. Of these 101 
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Figure 1 linkage map of Tigriopus califomicus, in centimorgans. 
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loci, there are other synonymous substitutions possible 
at 80 loci (Additional File 1). 

Synteny with model species 

Orthologues to 153 of the 190 T. californicus sequences 
were detected in the reference genomes (Additional File 
3). The greatest number of detected orthologues (144) 
was with reference to the D. melanogaster genome, and 
the lowest (72) was with D. pulex. There was evidence 
of macrosynteny between T. californicus and the A. mel- 
lifera genome. Nine pairs of orthologous genes were 
found to colocalize on chromosomes in both species, 
above the maximum of 7 pairs we might expect to find 
colocalizing by chance. After applying a Bonferroni cor- 
rection for 4 tests, for each of the reference genomes, 
the confidence interval for the number of syntenic units 
shared between the two species is still below the p = 
0.0125 threshold for significance as determined by per- 
mutation (Table 2, Table 3). 

Discussion 

We successfully mapped the T. californicus nuclear gen- 
ome, using 38 previously described SNPs, and 152 novel 
SNPs, to a total of 190 markers. This corresponds to an 
average of 1 marker every 1.29 Mbp, and an average 
intermarker distance of 1.8 cM. All markers were 
assigned to linkage groups, and the maximum intermar- 
ker distance was 10.3 cM. We will thus be able to use 
these markers to impute genotypes in interval mapping 
with 95% accuracy or better for all positions in the gen- 
ome. The 12 linkage groups obtained corresponded to 
the 12 known chromosomes of T. californicus, and most 
were anchored to linkage groups previously identified 
with microsatellite and Amplified Fragment Length 
Polymorphism (AFLP) markers [23]. The total corrected 
map length of 401.0 cM was somewhat lower than the 
estimated map length of Pritchard et al. [15], and the 
estimated total coverage of 73.7% was also somewhat 
lower than their 75.2% coverage, but this difference in 

Table 3 Chromosomes with colocalizing homologous 
sequences in T. californicus and A. mellifera. 



T. californicus chromosome A. mellifera chromosomes 



1 


1(3) 


14(2) 






2 


4(2) 


11(2) 


12(2) 


15(3) 


3 


1(2) 


6(3) 


11(2) 




>\ 


15(2) 








5 


1(4) 


2(3) 


15(2) 




6 


8(3) 








7 


1(2) 


10(3) 


11(3) 




8 


1(4) 


6(2) 


8(3) 


10(2) 


9 


7(2) 


11(3) 


16(2) 




10 


1(3) 


3(2) 


4(2) 


5(3) 



estimates is unsurprising given the much deeper cover- 
age we have achieved in the current study. 

All crosses between populations of T. californicus 
from Southern California to British Columbia produce 
fertile F x progeny [37]. Pairs of populations, however, 
exhibit increasing F 2 hybrid breakdown by distance- 
populations as close as 100 km can have a reduction in 
fitness of 50% in the F 2 relative to the parental popula- 
tions [13,37]. We found no evidence of segregation dis- 
tortion in the nauplii of our mapping population, 
however. This is consistent with previous studies which 
found generally undistorted allele frequencies in the 
early larval stage, while adults had allele frequencies 
which strongly departed from expected frequencies 
[15,38]. 

Primer sequences used for polymerase chain reaction 
DNA amplifications were largely conserved between 
populations. In only one population (PA) did more than 
5 primer sets fail to amplify a product, and we found 
many SNPs segregating within and between populations. 
The elevated differentiation between PA and the South- 
ern Californian populations was not unexpected, as Baja 
populations of T. californicus have been found to be 
very divergent in mitochondrial haplotype [13], and to 
exhibit high rates of hybrid breakdown when crossed 
with northern populations [39]. Many of our primers 
also amplified products in the sister species T. japonicus 
and T. brevicornis. The SNP markers we have character- 
ized here, therefore will be generally useful for QTL 
mapping across the entire geographic range of T. califor- 
nicus. For all but a few population pairs, it will be possi- 
ble to construct mapping lines and distinguish every 
chromosome in a non-recombinant backcross design. 

Additional SNPs may be found to segregate in these 
populations, since they were only genotyped for the 
known SD and SC SNPs (or SC, SD and a third popula- 
tion in the case of the SNPs from Pritchard et al. [15]). 
Only 20 of the SNP markers were in coding regions at 
twofold degenerate sites, suggesting as many as 89.5% of 
the SNPs may be under relaxed selective constraint. 
Populations will thus need to be characterized for alter- 
nate alleles for mapping purposes. The known level of 
triallely of SNPs is very low in T. californicus, however. 
V. Pritchard (pers. comm) found that between San 
Diego, Santa Cruz and Punta Baja (Baja California), 
levels of triallely are approximately 2% at SNP sites 
across the genome. These 3 populations are deeply 
divergent within T. californicus, suggesting that within- 
species at least, there will be few uncharacterized alleles. 

Compared with the other arthropods, the recombina- 
tion rate of T. californicus (1.6 cM/Mbp) is low 
[reviewed in 39], but is more than sufficient for map- 
ping in F 2 , backcross and other linkage mapping 
approaches. It is similar to that of Diptera(mean = 1.03 
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cM/Mbp) and shrimp Penaeus monodon (mean = 1.01 
cM/Mbp)[5], but it is approximately 21% the estimated 
recombination rate of Daphnia pulex at 7.52 cM/Mbp 
[40]. It is much lower than Hymenoptera (mean = 7.12 
cM/Mbp), and somewhat lower than the more typical 
arthropod recombination rate of Coleoptera (mean = 
2.48 cM/Mbp) [41]. It is known that recombination 
rates in groups with alternate life histories are under 
selection-for instance, social vs. solitary Hymenoptera 
have greatly elevated recombination rates. The relatively 
high recombination rate of D. pulex compared to 
shrimp and T. californicus is possibly due to its repro- 
ductive mode-many generations of parthenogenesis, 
punctuated by episodic sex, may have consequences on 
the evolution of recombination rate [42]. Another rea- 
son for the relatively low recombination rate might be 
that recombination occurs only in males in T. californi- 
cus [16,21]. 

Additionally, perhaps surprisingly, there was some evi- 
dence of macrosynteny between A. mellifera and T. cali- 
fornicus, although there was no one-to-one 
correspondence between chromosomes. No evidence of 
synteny was found T. californicus and D. pulex or D. 
melanogaster; however, this may be because these gen- 
omes have undergone higher rates of chromosomal rear- 
rangements. It was reported that no evidence of synteny 
was found between D. pulex and D. melanogaster either 
[40]. In fact, Critescu et al. [40] suggest-based on the 
rearrangement rate of Drosophila-that synteny should 
only be apparent between crustaceans and insects at 
scales smaller than 100 kb. It may be that Hymenoptera 
and Copepoda have lower rates of large-scale chromoso- 
mal fission and fusion than Drosophila and the other 
assayed arthropod lineages. 

Conclusion 

We have mapped the T. californicus genome to a level 
of coverage that will enable common mapping techni- 
ques such as nonrecombinant backcross, and F 2 or 
Recombinant Inbred Line mapping-greatly expanding 
the possible scope of work in this species. The markers 
and map developed will potentially be useful more 
broadly in the genus Tigriopus. The phylogenetic posi- 
tion of T. californicus, its compact genome size, and its 
tractability in the lab have all positioned it as an excel- 
lent model crustacean system. The continued develop- 
ment of genomic tools such as this linkage map will 
enable future exciting work in this species. 

Abbreviations used 

A. mellifera: Apis mellifera; D. pulex: Daphnia pulex; P. 
monodon: Penaeus monodon; T. castaneum: Tribolium 
castaneum; QTL: Quantitative Trait Locus; SNP: Single 
Nucleotide Polymorphism; T. californicus/japonicus/ 



brevicornis: Tigriopus spp.; cDNA: coding DNA. Popula- 
tions: AG: Aguilar, BC, 48°51'N, 125°08'W; CAR: Car- 
mel, CA, 36°55'N, 121°93W; Cat: Catalina Island, CA, 
33°27'N, 118 29'W; LC: Leo Carillo Beach, CA, 34°03'N, 
118°56W; LMC: Los Morros Colorados, Baja California, 
29°43'N, 115°18'W; PA: Playa Altamira, Baja California, 
31°52'N, 116°40'W; RP: Royal Palms, Palos Verdes CA, 
33°42'N, 118°19'W; SC: Santa Cruz, CA 36°57'N, 122° 
03'W; SD: San Diego, CA, 32°45'N, 117°15'W; SCI: 
Santa Cruz Island, CA, 34°03'N, 119°34'W; WIZ: 
Wizard, BC, 48°85'N, 125°16W. 
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Additional File 1: "Table of markers used to create the linkage map 

Showing calculated map positions, forward and reverse primer 
sequences, extension sequences with associated SD and SC alleles. 
Marker genotypes of a wide geographic sampling of populations, as well 
as sister species, are shown. Also shown is the sequence description 
from the most significant alignment obtained by Blast search (GenBank). 
Inferred San Diego and Santa Cruz triplet codons are shown, with the 
SNP position and amino acid translation. If SNPs are predicted to be 
synonymous, the number of additional synonymous alleles at the SNP 
site is indicated." 

Additional File 2: "Crossing scheme between San Diego (SD) and 
Santa Cruz (SC) to generate F2 and non-recombinant backcross 
(nrBC) mapping lines in Tigriopus californicus " 

Additional File 3: "Table of most significant tBIastx alignments IE- 
value < 10 3 ) between T. californicus sequences and the reference 
draft genome assemblies of model species Apis mellifera, Tribolium 
castaneum, Drosophila melanogaster and Daphnia pulex. For each 
significant alignment are shown the gi number, the E-value of the 
alignment, and the linkage information of the homologue. 



Acknowledgements and funding 

This work was funded by U.S. National Science Foundation grants DEB- 
0743472 to SE and DEB- 0717178 to RSB, and by a USC Women in Science 
and Engineering grant to SE. Many thanks to Jeffrey Conroy for his help 
with the SNP assays and to Grace Jang and Tara Matsuda for laboratory 
assistance. 

Author details 

department of Biological Sciences, University of Southern California, Los 
Angeles, CA 90089-0371, USA. 2 Windward School, Los Angeles, CA 90066- 
2104, USA. department of Organismic and Evolutionary Biology, Harvard 
University, Cambridge, MA 01730, USA. 4 Marine Biology Research Division, 
Scripps Institution of Oceanography, University of California, San Diego, La 
Jolla, CA 92037, USA. 

Authors' contributions 

SE designed the study, assisted with crosses, oversaw analyses and 
contributed to the final written manuscript. GWM and RSB generated 
transcriptome data. CGR developed the SNP assays, constructed the linkage 
map and assisted with crosses. DER and WL assisted with crosses. BRF 
performed all post-map analyses and wrote the manuscript. All authors 
contributed comments and approved the manuscript prior to submission. 

Received: 24 May 201 1 Accepted: 21 November 201 1 
Published: 21 November 201 1 

References 

1. Humes AG: How many copepods? Hydrobiologia 1994, 292-293:1-7. 

2. Turner JT: Pelagic Marine Food Webs. Zoological Studies 2004, 43:255-266. 



Foley ef al. BMC Genomics 201 1, 12:568 Page 8 of 8 

http://www.biomedcentral.eom/1 471 -2 1 64/1 2/568 



3. Colbourne JK, Pfrender ME, Gilbert D, Thomas WK, Tucker A, Oakley TH, 
Tokishita S, Aerts A, Arnold GJ, Basu MK, Bauer DJ, Caceres CE, Carmel L, 
Casola C, Choi J-H, Dong Q, Dusheyko S, Eads BD, Frohlich T, Geiler- 
Samerotte KA, Gerlach D, Hatcher P, Jogdeo S, Krijgsveld J, Kriventseva EV, 
Kultz D, Laforsch C, Lindquist E, Lopez J, et al: The Ecoresponsive Genome 
of Daphnia pulex. Science 201 1, 331:555-561. 

4. Du ZQ, Ciobanu DC, Onteru SK, Gorbach D, Mileham AJ, Jaramillo G, 
Rothschild MF: A gene-based SNP linkage map for pacific white shrimp, 
Litopenaeus vannamei. Animal genetics 2010, 41:286-94. 

5. You EM, Liu KF, Huang SW, Chen M, Groumellec ML, Yu HT: Construction 
of integrated genetic linkage maps of the tiger shrimp (Penaeus 
monodon) using microsatellite and AFLP markers. Animal genetics 2010, 
41:365-76. 

6. Otto SP, Whitton J: Polyploid incidence and evolution. Annual Review of 
Genetics 2000, 34:401-437. 

7. Koyama T, Asakawa S, Katagiri T, Shimizu A, Fagutao FF, Mavichak R, 
Santos MD, Fuji K, Sakamoto T, Kitakado T, Kondo H, Shimizu N, Aoki T, 
Hirono I: Hyper-expansion of large DNA segments in the genome of 
kuruma shrimp, Marsupenaeus japonicus. BMC genomics 2010, 1 1:141. 

8. Lecher P, Defaye D, Noel P: Chromosomes and nuclear DNA of Crustacea. 
Invertebrate Reproduction & Development 1995, 27:85-1 14. 

9. Raisuddin S, Kwok KWH, Leung KMY, Schlenk D, Lee J-S: The copepod 
Tigriopus: a promising marine model organism for ecotoxicology and 
environmental genomics. Aquatic toxicology (Amsterdam, Netherlands) 
2007, 83:161-73. 

10. Barreto FS, Moy GW, Burton RS: Interpopulation patterns of divergence 
and selection across the transcriptome of the copepod Tigriopus 
californicus. Molecular ecology 201 1, 20:560-72. 

11. Burton RS, Metz EC, Flowers JM, Willett CS: Unusual structure of ribosomal 
DNA in the copepod Tigriopus californicus: intergenic spacer sequences 
lack internal subrepeats. Gene 2005, 344:105-13. 

12. Willett CS: Potential fitness trade-offs for thermal tolerance in the 
intertidal copepod Tigriopus californicus. Evolution 2010, 2521-2534. 

1 3. Edmands S: Phylogeography of the intertidal copepod Tigriopus 
californicus reveals substantially reduced population differentiation at 
northern latitudes. Molecular ecology 2001, 10:1743-50. 

14. Burton RS, Feldman MW, Curtsinger JW: Population genetics of Tigriopus 
californicus (Copepoda: Harpacticoida): I. Population structure along the 
central California coast. Marine Ecology Progress Series 1979, 1:29-39. 

15. Pritchard VL, Dimond L, Harrison JS, Velazquez CCS, Zieba JT, Burton RS, 
Edmands S: Interpopulation hybridization results in widespread viability 
selection across the genome in Tigriopus californicus. BMC Genetics 201 1, 
12:54. 

16. Ar-Rushdi AH: The cytology of achiasmatic meiosis in the female 
Tigriopus (Copepoda). Chromosoma 1963, 13:526. 

1 7. Gordon D, Abajian C, Green P: Consed: a graphical tool for sequence 
finishing. Genome Research 1998, 8:195-202. 

18. Garcia-Gomez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, Robles M, 
Talon M, Dopazo J, Conesa A: High-throughput functional annotation and 
data mining with the Blast2GO suite. Nucleic Acids Res 2008, 
36(10):3420-3435. 

1 9. The National Center for Biotechnology Information website. [http://ncbi. 
nih.gov]. 

20. Burton RS: Mating system of the intertidal copepod Tigriopus californicus. 
Marine Biology 1985, 86:247-252. 

21. Burton RS, Feldman M, Swisher S: Linkage relationships among five 
enzyme-coding gene loci in the copepod Tigriopus californicus: a genetic 
confirmation of achiasmiatic meiosis. Biochemical genetics 1981, 
19:1237-45. 

22. Edmands S, Northrup SL, Hwang AS: Maladapted gene complexes within 
populations of the intertidal copepod Tigriopus californicus! Evolution 
2009, 63:2184-92. 

23. Harrison JS, Edmands S: Chromosomal basis of viability differences in 
Tigriopus californicus interpopulation hybrids. Journal of evolutionary 
biology 2006, 19:2040-51. 

24. van Ooijen J, Voorrips R: JoinMap' 3.0, Software for the calculation of 

genetic linkage maps. Plant Research International 2001. 

25. Weinstock GM, Robinson GE, Gibbs RA, Worley KC, Evans JD, Maleszka R, 
Robertson HM, Weaver DB, Beye M, Bork P, Elsik CG, Hartfelder K, Hunt GJ, 
Zdobnov EM, Amdam GV, Bitondi MMG, Collins AM, Cristino AS, Michael H, 
Lattorff G, Lobo CH, Moritz RFA, Nunes FMF, Page RE, Simes ZLP, 



Wheeler D, Carninci P, Fukuda S, Hayashizaki Y, Kai C, Kawai J, et al: Insights 
into social insects from the genome of the honeybee Apis mellifera. 
Nature 2006, 443:931-949. 

26. Adams MD: The Genome Sequence of Drosophila melanogaster. Science 
2000, 287:2185-2195. 

27. Richards S, Gibbs RA, Weinstock GM, Brown SJ, Denell R, Beeman RW, 
Gibbs R, Bucher G, Friedrich M, Grimmelikhuijzen CJP, Klingler M, 
Lorenzen M, Roth S, Schroder R, Tautz D, Zdobnov EM, Muzny D, 
Attaway T, Bell S, Buhay CJ, Chandrabose MN, Chavez D, Clerk- 
Blankenburg KP, Cree A, Dao M, Davis C, Chacko J, Dinh H, Dugan-Rocha S, 
Fowler G, et al: The genome of the model beetle and pest Tribolium 
castaneum. Nature 2008, 452:949-55. 

28. The NCBI FTP site for Apis mellifera amino acid sequences, [ftp://ftp.ncbi. 
nih.gov/genomes/Apis_mellifera/protein], 

29. Brown SJ, Caragea D: KSU Bioinformatics Center.[ftp://bioinformatics.ksu. 
edu/pub/BeetleBase/3.0/Sequences/Manually_Curated_CDS_Peptide_mRNA/ 
Manually_Curated_Peptide]. 

30. Flybase Genomes FTP site. [ftp://ftp.flybase.net7genomes/aaa/dmel_r4.3]. 

31. Indiana University Bio-Archive, [ftp://iubio.bio.indiana.edu/daphnia/ 
genome/Daphnia_pulex]. 

32. R Development Core Team: R: A language and environment for statistical 
computing. 2010. 

33. Chakravarti A, Lasher LK, Reefer JE: A maximum likelihood method for 
estimating genome length using genetic linkage data. Genetics 1991, 
128:175-82. 

34. Turelli M, Orr HA: Dominance, epistasis and the genetics of postzygotic 

isolation. Genetics 2000, 154:1663-79. 

35. Wyngaard GA, Rasch EM: Patterns of genome size in the copepoda. 
Hydrobiologia 2000, 417:43-56. 

36. Gregory TR, Nicol JA, Tamm H, Kullman B, Kullman K, Leitch U, Murray BG, 
Kapraun DF, Greilhuber J, Bennett MD: Eukaryotic genome size databases. 
Nucleic acids research 2007, 35:D332-8[http://www.genomesize.com]. 

37. Edmands S: Heterosis and outbreeding depression in interpopulation 
crosses spanning a wide range of divergence. Evolution 1999, 
53:1757-1768. 

38. Willett CS, Berkowitz JN: Viability effects and not meoitic drive cause 
dramatic departures from Mendelian inheritance for malic enzyme in 
hybrids of Tigriopus californicus populations. Journal of evolutionary 
biology 2007, 20:1196-205. 

39. Ganz HH, Burton RS: Genetic differentiation and reproductive 
incompatibility among Baja California populations of the copepod 
Tigriopus californicus. Marine Biology 1995, 123:821-827. 

40. Cristescu MEA, Colbourne JK, Radivojac J, Lynch M: A microsatellite-based 
genetic linkage map of the waterflea, Daphnia pulex: On the prospect of 
crustacean genomics. Genomics 2006, 88:415-30. 

41. Wilfert L, Gadau J, Schmid-Hempel P: Variation in genomic recombination 
rates among animal taxa and the case of social insects. Heredity 2007, 
98:189-97. 

42. Lande R: The genetic correlation between characters maintained by 
selection, linkage and inbreeding. Genetical research 1984, 44:309-320. 



doi:10.1 186/1471-2164-12-568 

Cite this article as: Foley ef al:. A gene-based SNP resource and linkage 
map for the copepod Tigriopus californicus. BMC Genomics 201 1 12:568. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at 
www.biomedcentral.com/submit 



BioMed Central 



